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A method of fulfilling an information need employing an index stored on a 
computer-readable medium and comprised of preanalyzed contexts of terms 
appearing within a plurality of documents, comprising the steps of: 

receiving a query comprised of one or more fully specified terms and an 
information need, wherein the information need is represented by one or 
more at least partially unspecified terms; 

identifying contexts in the index that contain the one or more fully 
specified terms and zero or more at least partially unspecified terms; and 
locating one or more matches for the information need within the 
identified contexts. 

The method of claim 1, further comprising the step of: 

identifying documents in the index that contain the one or more at least 
partially unspecified terms. 

The method of claim 1, wherein the locating step further comprises: 
converting the query into a finite state machine; and 
matching the finite state machine against the identified contexts. 

The method of claim 3, wherein the finite state machine is a finite state 

transducer. 

The method of claim 3, wherein the finite state machine allows for the appearance 
of fully specified and at least partially unspecified terms in any order in a 
potential matching context. 

The method of claim 3, wherein the fmite state machine allows for one or more 
intervening words between the fully specified and at least partially unspecified 
terms in a potential matching context. 

A method of fiilfiUing an information need employing an index stored on a 
computer-readable medium and comprised of preanalyzed contexts of terms 
appearing within a plurality of documents, information indicating category 
restrictions that the terms and contexts satisfy, and identifiers of the documents 
and contexts containing the terms, comprising the steps of: 
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receiving a query comprised of one or more fully specified terms and an 
information need and at least a partid restriction on the order that the one 
or more fully specified terms and the information need may appear in a 
potential matching context, wherein the information need is represented by 
one or more at least partially unspecified terms reflecting a category 
restriction; 

identifying contexts in the index that contain the one or more fully 
specified terms and tiie one or more at least partially unspecified terms in 
the specified order; 

locating one or matches for the information need wittiin the identified 
contexts. 

The method of claim 7, fijrther comprising the step of: 

identifying documents in the index that contain the one or more fully 
specified terms and the one or more at least partially unspecified terms in 
the specified order. 

The method of claim 7, wherein the locating step further comprises: 
converting the query into a finite state machine; and 
matching the finite state machine against the identified contexts. 

The method of claim 9, wherein tiie fmite state machine is a finite state 

transducer. 

The method of claim 9, wherein the finite state machine assigns a dissimilarity 
weight to appearances of fiilly specified and at least partially unspecified terms in 
potential matching contexts in orders other tiian that specified in the query. 
The metiiod of claim 9, wherein the finite state machine allows for one or more 
intervening words between tiie fiilly specified and at least partially unspecified 
terms in a potential matching context. 
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13. A method of fulfilling an information need employing an index stored on a 
computer-readable medium and comprised of preanalyzed contexts of terms 
appearing within a plurality of documents, comprising the steps of: 

receiving a query comprised of one or more Mly specified terms and an 

information need, wherein the information need is represented by one or 

more at least partially unspecified terms; 

converting the query into a Boolean expression; 

identifying contexts identifiers satisfying the Boolean expression; and 

locating one or more matches for the uiformation need within the 

identified contexts. 

14. The method of claim 13, wherein the identifying step further comprises the steps 
of: 

identifying document identifiers satisfying the Boolean expression. 

15. The method of claims 1, 7 or 13, fiirther comprising the steps of: 

converting the query into a fmite state machine; and 

matching the finite state machine against the contents of the contexts 

associated with the identified context identifiers. 

16. The method of claims 1,7 or 13, wherein the contexts are stored as finite state 
machines. 

17. The method of claims 1, 7 or 13, wherein the documents are accessible over the 
Ihtemet. 

18. The method of claims 1, 7 or 13, wherein the documents comprise World Wide 
Web pages. 

19. The method of claims 1, 7 or 13, fiirther comprising the step of: 

accumulating information about the one or more matches as they are 
located. 

20. The method of claim 1 , further comprising the step of: 

assigning a score to a match. 
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1 21. The method ofclaim 20, wherein the score reflects the number of times an 

2 instance of the match is located among the plurality of documents. 

3 22. The method of claim 1, fiuther comprising the step of: 

4 outputting one or more of the matches, or a portion tiiereof, thereby 

5 providing a resixlt for the query. 

6 23 . The method of claim 22, further comprising the step of: 

7 outputting identifiers or locations of one or more of the documents that 

8 contains a match or portion thereof that was ou^ut in the outputting step. 

9 24. The method of claim 23, wherein a location or a document comprises a uniform 

10 resourese locator. 

11 25. The method of claim 23, further comprising the step of: 

1^ 1 2 ranking the documents that contain a match, and wherein the second 

Q 1 3 outputting step comprises outputting the document identifiers or locations 

0 14 ofthe documents that contain a match in an order based on the ranking. 

1 15 26. The method of claim 25, wherein the ranking step comprises ranking a document 

iff 

ly 1 6 based on the number of times a match is located within the document, 

f 17 27. The method of claim 1 , wherein the category restriction comprises a 

ni 18 morphological feature. 

19 28. The method ofclaim l,wherem the category restriction comprises a syntactic 

H 20 feature. 

21 29. The method ofclaim 1, wherein the category resttictioncompnses a computer 

22 program. 

23 30. The method of claim 1 , further comprising the step of: 

24 storing a match or a portion thereof. 

25 31. The meliiod of claim 30, further comprising the step of: 

26 storing a score for the match or portion thereof. 

27 32. The method of claim 1 , further comprising the step of: 

28 storing a plurality of matches or portions thereof 

29 33 . The method of claim 1 , further comprising the step of: 
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storing a score for a plurality of matches or portions thereof. 
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34. 


The method of claim 1, wherein the index comprises locations of terms within 
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documents. 
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35. 


The method of claim 34, wherein the locating step further comprises: 
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determining the location of a term in the query within a document using 
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the index; and 
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locating a match for the query based on the location of the term within the 
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document. 
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36. 


The method of claim 1, further comprising the step of: 
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ranking a plurality of the located matches or portions thereof. 




11 


37. 


The method of claim 36, wherein the ranking step comprises: 
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ranking a located match or a portion thereof based on the content of a 
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plurality of documents identified in the identifying step. 
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The method of claim 36, wherein the ranking step comprises: 
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ranking a located match or a portion thereof based on the content of a 
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majority of documents identified in the identifying step. 
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39. 


The method of claim 36, wherein tiie ranking is based on one or more features 
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selected from the hst consisting of: the location of a match within a document, a 
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weight assigned to a document that contains a match, the age of a document that 
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contains a match, the source of a document that contains a match, and a format 
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feature of a match within a document. 
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The method of claim 36, wherein the rankmg step comprises: 
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ranking a located match or a portion thereof based on the number of times 
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an instance of the match is located within a plurahty of documents 
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identified in the identifying step. 
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41. 


The method of claim 36, wherein the ranking step comprises: 
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ranking a located match or a portion thereof based on the number of times 




28 




an instance of the match is located within a majority of documents 
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identified in the identifying step. 
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The method of claiins 36, 38 or 39, further comprising the step of: 

outputtmg one or more of the located matches, or one or more portions 
thereof, in an order based on the ranking, thereby providing a result for the 
query. 

The method of claim 42, further comprising the step of: 

outputting an indication of the ranking of a located match or portion 
thereof 

A method of fulfilling an information need based on documents stored on a 
computer-readable medium comprising the steps of: 

storing an index identifymg documents containing terms; 

storing contexts for terms, wherein a context occurs in a document; 

receiving a query containing an unspecified portion; and 

identifying one or more matches for the query within the contexts. 
A method of fulfilling an information need based on documents stored on a 
computer-readable medium comprising the steps of: 

storing an index identifying documents containing terms; 

storing contexts, wherein a context occurs in a document; 

storing information to retrieve a Ust of contexts for terms; 

receiving a query containing an at least partially unspecified portion; and 

identifying one or more matches for the query within the contexts. 
The method of claim 45, wherein contexts are preanalyzed. 

A method of fulfilling an information need based on documents and an index 
stored on a computer-readable medium comprising the steps of: 

storing contexts for terms, wherein the context occurs in a document; 

storing information identifying a document in which a context occurs; 

receiving a query containing an unspecified portion; and 

identifying a plurality of matches for the query within the contexts.48. 

A method of generating an index for satisfying an information need firom a 
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1 48. A method of generating an index for satisfying an information need from a 

2 plurality of document stored on a computer-readable medium, comprising the 

3 steps of: 

4 receiving a document; 

5 identifying a context in the document; 

6 linguistically analyzing the context;selecting a term from the document; 

7 determining if there are more terms in the context to select, and if so, 

8 selecting another term from the context until there are no more 

9 terms to select; 

1 0 determining if there are more contexts to identify in the document, 

1 1 and if so, identifying another context in the document and repeating the 
M 12 term selecting step for each term in the context until there are no more 
Q 1 3 contexts to identify; 

J 1 4 determining if there are more documents to receive, and if so, 

jO 1 5 receiving tiie next document and repeating the context identifymg and 

fll 16 term selecting steps until there are no more documents to receive. 

L 1 7 49. The method of claim 48, wherein linguistically analyzing the one or more 

f V 1 8 contexts further comprises identifying category restrictions satisfied by the 

I J 19 context. 

H 20 50. The method of claim 48, further comprising the step of: 

21 storing information indicating the number of contexts stored for a term in 

22 a document, a document identifier associated with the document, and 

23 context identifiers indicating the location of the one or more contexts 

24 stored in the context array. 
25 

26 51. The method of claim 48, wherein the analyzed contexts are stored as finite state 

27 machines. 

28 52. The method of claim 48, wherein the analyzed contexts are stored as graphs. 

29 53 . The method of claim 48, wherein the analyzed contexts are stored as trees. 
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The method of claim 48, wherein the category restrictions comprise 
morphological features. 

The method of claim 48, wherein the category restrictions comprise syntactic 
features. 

The method of claim 48, wherein the category restrictions comprise computer 
programs. 

An apparatus for fulfilhng an information need employing an index stored on a 
computer-readable medium and comprised of preanalyzed contexts of terms 
appearing within a plurality of documents and information indicating category 
restrictions that the terms and contexts satisfy, comprising: 

memory means that stores computer-executable process steps; and 
a processor that executes the process steps so as to (i)receive a query 
comprised of one or more fully specified terms and an information need, 
wherein the information need is represented by one or more at least 
partially unspecified terms reflecting a category restriction; (ii) identify 
preanalyzed contexts in the index that contain the one or more fully 
specified terms and the one or more at least partially unspecified terms; 
and (ill) locate one or matches for the information need within the 
identified preanalyzed contexts. 
An apparatus for fulfilling an information need employing an index stored on a 
computer-readable medium and comprised of preanalyzed contexts of terms 
appemng within a plurality of documents, information indicating category 
restrictions that the terms and contexts satisfy, and identifiers of the documents 
and contexts contaming the terms, comprising: 

memory means that stores computer-executable process steps; and 
a processor that executes the process steps so as to (i) receive a query 
comprised of one or more fully specified terms and an information need 
^d at least a partial restriction on the order that the one or more fully 
specified terms and the information need may appear in a potential 
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matching preanalyzed context, wherein the information need is 
represented by one or more at least partially unspecified terms reflecting a 
category restriction; (ii) identify preanalyzed contexts in the index that 
contain the one or more fully specified terms and the one or more at least 
partially vinspecified terms in the specified order; and (iii) locate one or 
matches for the information need within the identified preanalyzed 
contexts. 

An apparatus for fulfilling an information need employing an index stored on a 
computer-readable medium and comprised of preanalyzed contexts of terms 
appearing within a plurality of documents, information indicating category 
restrictions that the terms and contexts satisfy, and identifiers of the documents 
and contexts containing the terms, comprising: 

memory means that stores computer-executable process steps; and 
a processor that executes the process steps so as to (i) receive a query 
comprised of one or more fidly specified terms and an information need 
and at least a partial restriction on the order that the one or more fiilly 
specified terms and the information need may appear in a potential 
matching preanalyzed context, wherein the information need is 
represented by one or more at least partially unspecified terms reflecting a 
category restriction; (ii) identify preanalyzed contexts and documents in 
the index that contain the one or more fiilly specified terms and the one or 
more at least partially unspecified terms in the specified order by 
converting the query into a Boolean expression and identifying context 
identifiers satisfying the Boolean expression; and (iii) locate one or 
matches for the information need within the identified preanalyzed 
contexts by converting the query into a finite state machine and matching 
the finite state machine against the identified preanalyzed contexts. 
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Computer-executable process steps stored on a computer-readable medium, the 
computer-executable process steps to fulfill an information need, the computer- 
executable process steps comprising: 

code to receive a query containing an unspecified portion, the unspecified 

portion including an unspecified term; and 

code to identify one or more matches for the query within a body of 

information stored on a computer-readable medium. 
Computer-executable process steps stored on a computer-readable medium, the 
computer-executable process steps to fulfill an information need, the computer- 
executable process steps comprising: 

code to store contexts for terms, wherein a context occurs in a document, 

code to store information identifying a document in which a context 

occurs; 

code to receive a query containing an imspecified portion; and 

code to identify one or more matches for the query within the contexts. 
Computer-executable process steps stored on a computer-readable medium, the 
computer-executable process steps to fidfill an information need, the computer- 
executable process steps comprising: 

code to identify a plurality of matches for a partially unspecified query; 

and 

code to rank a plurality of the matches or portions tiiereof. 
Computer-executable process steps stored on a computer-readable medium, the 
computer-executable process steps to fulfill an information need, the computer- 
executable process steps comprising: 

code to identify a plurality of results for a query, the results occurring 

within documents; and 

code to rank the plurality of results based on the content of a plurality of 
documents in which a result is identified. 
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